Search results for "semi-structured data"

showing 5 items of 5 documents

Building Semantic Trees from XML Documents

2016

International audience; The distributed nature of the Web, as a decentralized system exchanging information between heterogeneous sources, has underlined the need to manage interoperability, i.e., the ability to automatically interpret information in Web documents exchanged between different sources, necessary for efficient information management and search applications. In this context, XML was introduced as a data representation standard that simplifies the tasks of interoperation and integration among heterogeneous data sources, allowing to represent data in (semi-) structured documents consisting of hierarchically nested elements and atomic attributes. However, while XML was shown most …

Document Structure DescriptionComputer Networks and CommunicationsComputer sciencecomputer.internet_protocolSemantic analysis (machine learning)Efficient XML InterchangeInteroperabilityXML SignatureWord sense disambiguation02 engineering and technologycomputer.software_genreSemantic networkSemantic ambiguityXML Schema Editor020204 information systemsNode (computer science)0202 electrical engineering electronic engineering information engineering[INFO]Computer Science [cs]XML schemaContext representationcomputer.programming_languageXML treeInformation retrievalKnowledge basesSemi-structured dataXML validationcomputer.file_formatSemantic interoperabilityXMLHuman-Computer InteractionXML databaseSemantic similaritySemantic-aware processing020201 artificial intelligence & image processingWeb servicecomputerSoftwareXML

researchProduct

A novel XML document structure comparison framework based-on sub-tree commonalities and label semantics

2012

International audience; XML similarity evaluation has become a central issue in the database and information communities, its applications ranging over document clustering, version control, data integration and ranked retrieval. Various algorithms for comparing hierarchically structured data, XML documents in particular, have been proposed in the literature. Most of them make use of techniques for finding the edit distance between tree structures, XML documents being commonly modeled as Ordered Labeled Trees. Yet, a thorough investigation of current approaches led us to identify several similarity aspects, i.e., sub-tree related structural and semantic similarities, which are not sufficient…

Document Structure DescriptionComputer Networks and Communicationscomputer.internet_protocolComputer scienceEfficient XML Interchange[SCCO.COMP]Cognitive science/Computer science0102 computer and information sciences02 engineering and technologycomputer.software_genre01 natural sciencesSemantic similarityXML Schema Editor020204 information systems0202 electrical engineering electronic engineering information engineeringXML schemacomputer.programming_languageInformation retrieval[INFO.INFO-DB]Computer Science [cs]/Databases [cs.DB][INFO.INFO-WB]Computer Science [cs]/Web[INFO.INFO-MM]Computer Science [cs]/Multimedia [cs.MM]XML validationcomputer.file_formatDocument clusteringHuman-Computer InteractionXML frameworkTree (data structure)XML databaseTree structure010201 computation theory & mathematics[INFO.INFO-IR]Computer Science [cs]/Information Retrieval [cs.IR]020201 artificial intelligence & image processingSemi-structured dataEdit distancecomputerSoftwareXMLXML CatalogData integration

researchProduct

XML document-grammar comparison: related problems and applications

2011

10.2478/s13537-011-0005-1; International audience; XML document comparison is becoming an ever more popular research issue due to the increasingly abundant use of XML. Likewise, a growing interest fosters the development of XML grammar matching and comparison, due to the proliferation of heterogeneous XML data sources, particularly on the Web. Nonetheless, the process of comparing XML documents with XML grammars, i.e., XML document and grammar similarity evaluation, has not yet received the attention it deserves. In this paper, we provide an overview on existing research related to XML document/grammar comparison, presenting the background and discussing the various techniques related to th…

Document Structure DescriptionXML grammarXML Encryptionselective disseminationGeneral Computer ScienceComputer scienceEfficient XML Interchange[SCCO.COMP]Cognitive science/Computer scienceWell-formed document02 engineering and technologyWorld Wide WebXML Schema Editor[SCCO.COMP] Cognitive science/Computer science020204 information systemsStreaming XML0202 electrical engineering electronic engineering information engineeringPROCESSAMENTO DE IMAGENSXML schemacomputer.programming_languageInformation retrievalXSDgrammar evolutionXML validationstructural similarityQA75.5-76.95computer.file_formatXMLDTDclassificationElectronic computers. Computer science[ SCCO.COMP ] Cognitive science/Computer scienceComputingMethodologies_DOCUMENTANDTEXTPROCESSING020201 artificial intelligence & image processingsemi-structured datacomputerclusteringstructure transformation

researchProduct

Export of Relational Databases to RDF Databases: A Case Study

2010

The vast amount of business information nowadays is stored in relational databases. For the Semantic Web vision to become a reality, we need ways how to exploit this data in form of RDF triples. The universal and commonly accepted solution for this problem still does not exist. In most cases, mapping languages are used for specification of correspondences between OWL ontology and DB schema. At the same time, these languages generally are not well suited for specification of mappings in cases when there is a substantial difference between OWL ontology and DB schema. In this paper, we describe a new model transformation-based method for specification of correspondences between the elements of…

Information retrievalComputer scienceEntity–relationship modelDatabase schemaSPARQLWeb Ontology LanguageSemi-structured datacomputer.file_formatRDFcomputerInformation schemacomputer.programming_languageDatabase model

researchProduct

A Semantic Layer on Semi-structured Data Sources for Intuitive Chatbots

2009

The main limits of chatbot technology are related to the building of their knowledge representation and to their rigid information retrieval and dialogue capabilities, usually based on simple "pattern matching rules". The analysis of distributional properties of words in a texts corpus allows the creation of semantic spaces where represent and compare natural language elements. This space can be interpreted as a "conceptual" space where the axes represent the latent primitive concepts of the analyzed corpus. The presented work aims at exploiting the properties of a data-driven semantic/conceptual space built using semi-structured data sources freely available on the web, like Wikipedia. Thi…

Information retrievalKnowledge representation and reasoningbusiness.industryComputer scienceComputer Science::Information Retrievalcomputer.software_genreChatbotsemantic spaces chatbotSemantic similarityExplicit semantic analysisEncyclopediaSemi-structured dataPattern matchingArtificial intelligencebusinesscomputerNatural language processingNatural language

researchProduct